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Abstract 

Let t be a rooted tree and n,(f) the number of nodes in t having i children. The degree sequence 
{ni{t),i > 0) of t satisfies X^j>o'^j(t) = 1 + X^i>o^'^i(*) = 1*1' where \t\ denotes the number of 
nodes in t. In this paper, we consider trees sampled uniformly among all plane trees having the same 
degree sequence s; we write Pg for the corresponding distribution. Let s{k) = (^^(k), i > 0) be a list of 
degree sequences indexed by k corresponding to trees with size — > +00. We show that under some 

simple and natural hypotheses on (s(fi;), k > 0) the trees sampled imder Ps(k) converge to the Brownian 

1/2 

continuimi random tree after normahsation by . Some apphcations concerning Galton- Watson trees 
and coalescence processes are provided. 

1 Introduction 

Let t be a rooted tree and ni{t) the number of nodes in t having i children. The sequence {ni(t),i > 0) 
is called the degree sequence of t, and satisfies J2i>o i^iit) = 1 + Yli>o ii^iit) = 1*1' number of nodes 
in t. 

The aim of this paper is to study trees chosen under Pg, the uniform distribution on the set of plane 
trees with specified degree sequence s = (nj, i > 0), and then size |s| := J2i>o More precisely, a 
sequence of degree sequences (s(k), k > 0) with s{k) = {ni{K),i > 0), corresponding to trees with size 

:= |s(«;)| — > +00 is given, and the investigations concern the Umiting behaviour of tree under Ps(k). 




Figure 1: The 10 trees of Tg for the degree sequence s = (3, 1, 2, 0, 0, . . . ). 
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We now introduce some notation valid in the entire paper. We denote by p{k) = (pi{K),i > 0) the 
degree distribution under Ps(k)' 

P^{k) = (1) 

Let also 

2 ^ y n^^2 _ 

^ - 1 ' 

i>i 

(T^ is "almost" the associated variance, this choice of definition yields shorter formulae in the following. 
The maximum degree of any tree with degree sequence s{k) is 

Aft = max{i : ni{K,) > 0}. 

Throughout the paper p = {pi,i > 0) is a distribution with mean 1, and variance Up G (0, +oo) = 
Si>o ^^P* ~ 1 £ (Oi oo). In the following theorem, which is the main result of the present paper, p(At) =^ p 
means equivalence in distribution, which here means that for any i > 0, Pi{K) — )• pi, as k — )• oo. 

1 /2 

Theorem 1. Let (s(k),k > 0) be a sequence of degree sequences such that — s- +oo, = 0(11^ ), 
p(k) =^ p with (T^ —7- dp, that is convergence of second moment. Let t be a plane tree chosen under 

— 1/2 

^s(k) '^^^ dt be the graph distance in t. Under Ps(k)' when k — )■ +cxd, {t,ai^n^ dt) converges in 
distribution to Aldous ' continuum random tree ( encoded by twice a Brownian excursion ), in the Gromov— 
Hausdorjf sense. 

First observe that the very strong result of Haas and Miermont [26] about the asymptotics of Markov 
branching trees that has been used to give asymptotics for random trees in a wide variety of settings does not 
apply in the present case of trees with a prescribed degree sequence. Indeed, the subtrees of a given node 
are not independent given their sizes when one fixes the degree sequence. Our approach uses instead the 
observation done by Marckert and Mokkadem [37] that all natural encodings of the trees are asymptotically 
proportional in the case of Galton-Watson trees conditioned by the size. The same property will also hold 
here. In particular, the height process or the contour process both encoding the metric structure of the tree 
resemble the depth-first queue process encoding the sequence of degrees observed when performing a depth- 
first traversal. This fact was used by Marckert and Mokkadem [37] to give an alternative proof of Aldous' 
result in the case of Galton-Watson trees conditioned on the total progeny under some moment condition 
(Bennies and Kersting [13] also observed this phenomenon). 

One of the crucial questions underlying our work is that of the universality of the convergence of random 
trees to the continuum random tree (CRT). 

We are motivated by the metric structure of graphs with a prescribed degree sequence. Introduced by 
Bender and Canfield [12] and by Bollobas [20] in the form of the configuration model, these graphs have 
received a lot of attention since the first tight analysis of the size of connected components by Molloy and 
Reed [39, 40]. This is mainly because the model allows for a lot of flexibility in the degree sequence. In 
particular, the model provides a construction of random graphs with degree sequences that may match the 
observations in large real-world networks. 

Of course, random graphs with a prescribed degree sequence are much more complex than trees with 
a prescribed degree sequence, but there is no doubt that the analysis of trees is a first step towards the 
identification of the metric structure of the corresponding graphs. Indeed, recent results of Joseph [32] 
show that under some moment condition, the sizes of the connected components of random graphs with a 
prescribed critical degree sequence are similar to those of Erdos-Renyi G(n, p) random graphs [2 1 , 24, 30] : 
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they may be asymptotically described in terms of the lengths of the excursions of a Brownian motion with 
parabolic drift above its current minimum, as demonstrated by Aldous [9]. (See also [45], where it is 
supposed that the maximum degree is bounded.) On the other hand, the metric structure of G{n,p) inside 
the critical window has recently been identified in terms of modifications of Brownian CRT by Addario- 
Berry, Broutin, and Goldschmidt [2, 3]. In other words, the present analysis is one more building block 
towards an invariance principle for scaling limits of random graphs, i.e., that critical random graphs with a 
prescribed degree sequence have (under a suitable moment condition on the degree distribution) the same 
scaling limit (as sequence of compact metric spaces) as classical random graphs [3]. This is at least what 
is suggested by the results of Bhamidi, van der Hofstad, and van Leeuwaarden [18], van der Hofstad [47], 
Joseph [32] and Riordan [45]. 

Moreover, in the same way that uniform random trees or forests may be seen as the results of coagu- 
lation/fragmentation processes involving particles [42, 43], trees with a prescribed degree sequence appear 
naturally in similar aggregation processes. The model where particles have constrained valence may ap- 
pear more "physically" grounded. The relevant underlying coalescing procedure is the additive coalescent 
[10, 15], a Markov process whose dynamics are such that particles merge at a rate proportional to the sum of 
their masses/sizes. The additive coalescent is the aggregation process appearing in Knuth's modification of 
Renyi's parking problem [28, 44] or the hashing with linear probing [17, 22]. The reader may find more in- 
formation about coagulation/fragmentation processes in the monograph by Bertoin [16] or the recent survey 
by Berestycki [14]. 

The model Pg is related to Galton-Watson trees [11, 27], also called simply generated trees in the 
combinatorial literature, by a simple conditioning: the distribution Pg coincides with the distribution of the 
family tree t of a Galton-Watson process with offspring distribution > 0) (which must satisfies f « > 
if rii > 0) conditioned on {nj(t) = ni,i > 0}. Indeed, Pg assigns the same probability to all trees with the 
same degree sequence. In this sense, the distribution v plays a role of secondary importance, and Pg appears 
to be a model of combinatorial nature, far from the world of Galton-Watson processes. Nevertheless, we 
will see that Theorem 1 implies the following result of Aldous (stated in a slightly different form in [6]) (see 
also [6-8, 34, 37]), where Ht is the height process of t (the definition is recalled in the next section). 

Proposition 2 (Aldous [6]). Let fi = > 0) be a distribution with mean = 1 and variance 

G (0, +oo), and let P^ be the distribution of a Galton-Watson tree with offspring distribution fi. Along 
the subsequence {n : P^{\t\ = n) > 0}, under ■ | |t| = n) 



where e denotes a standard Brownian excursion, the convergence holding in the space C[0, 1] equipped with 
the topology of uniform convergence. 

We will see that this theorem may be seen indeed as a consequence of Theorem 1 ; the argument morally 
relies on the fact that under P^( . | |t| = n), the empirical degree sequence satisfies the hypotheses of 
Theorem 1 with probability going to 1 (this is stated in Lemma 11). The proof of this theorem is postponed 
until Section 6. 

Note also results of Rizzolo [46] and Kortchemski [33] that have a flavor similar to our Theorem 1 
(although neither implies the other): they proved that Galton-Watson trees conditioned on the number of 
nodes having their degrees in a subset A of the support of the measure /x has a limiting behaviour depending 
on A. For instance, they consider trees conditioned on the number of leaves, the number of nodes with other 
out-degrees being left free. The proofs in Rizzolo [46] rely ultimately on the approach based on Markov 
branching trees developed by Haas and Miermont [26]. 
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Plan of the paper. In Section 2 we introduce precisely the model of trees we consider. Section 3 is de- 
voted to a useful backbone decomposition for these trees. We then prove our main result, the convergence of 
rescaled trees to the continuum random trees, in Section 4. Finally, the application to coagulation processes 
with particles with constrained valence is developed in Section 7. 

2 Trees with prescribed degree sequence 

We here define formally the combinatorial object discussed in this paper. For convenience we write 
N = {1, 2, . . . } for the set of positive natural numbers. First recall some definitions related to standard 
rooted plane trees. Let U = Un>o finite words on the alphabet N, where N'^ = {0}, and 

denotes the empty word. Denote by uv the concatenation of u and v; by convention 0u = u0 = u. 

A subset T of is a plane tree (see Figure 2) if 

• it contains (called the root), 

• it is stable by prefix (if uv ^ T for u and v inU, then u € T), and 

• if (uk G T for some k > 1 and u ^ U) then uj € T for j in {1, . . . , k}. 

This last condition appears necessary to get a unique tree with a given genealogical structure. The set of 
plane trees will be denoted by T. 
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6v 6 
Figure 2: Usual representation of the plane tree {0, 1, 11, 12, 13, 14, 15, 131, 151, 152} 

Notice that the lexicographical order < onU, also named the depth-first order, induces a total order on 
any tree t; this is of prime importance for the encodings of t we will present. For t G T, and u ^ t, let 
ct{u) = max{z : ui £ t} be the number of children of u in t. The depth of u in t, its number of letters as 
a word in U, is denoted \u\. The notation \t\ refers to the cardinality of t, its number of nodes including the 
root 0. 

With a tree t £ T, one can associate its degree sequence s{t) = {ni{t),i > 0), where ni{t) = i^{u G 
t : ct{u) = i} is the number of nodes with degree i in t. For a fixed degree sequence s, write Tg for 
the set of trees t € T such that s{t) = s, and let Pg be the uniform distribution on Tg. To investigate 
the shape of random trees under Ps> we will use the usual encodings: height process H and depth-first 
walk S (or Lukasiewicz path) and contour process C. These encodings are defined by first fixing their 
values at the integral points, and then linear interpolation in between (See Figure 3). For a tree t G T, let 
ui = < U2 < • • • < u\t\ denote the nodes of t sorted according to the lexicographic order. Then we define 
H = Htby H{i) = |ni+i|, 5 = S'f by St{i) = Y^\=i{ct{uj) — 1); the process Ht is defined on [0, |t| — 1] and 
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St on [0, \t\]. For the contour process Ct of t, we need to define first a function ft : {0, ... , 2{\t\ — 1)} i— )• t 
which can be regarded as a waUc around t; first set /t(0) = 0, the root. For i < 2{\t\ — 1), given ft{i) = v, 
ft{i + 1) is u, the smallest child of v (for the lexicographical order) absent from the list {/t(0), . . . , ft{i)} , 
and the father of v if no such u exists. The contour process has the following values on integer positions 

Ct{i) = \ft{i)\, iG{0,...,2{\t\-l)}. 



Figure 3: A plane tree t G T, its height process Ht, Lukasiewicz walk St and its contour process Ct- 
Theorem 3. Under the hypothesis of Theorem 1, under Ps(k)' 

in distribution in the space C([0, 1], M'^) of continuous functions from [0, 1] with values in W^, equipped with 
the supremum distance. 

The contour process is a kind of interpolation of the height process. The fact that both these processes 
have the same asymptotic behaviour is well understood in some general settings : it is shown in Marckert 
and Mokkadem (Lemma 3.19 [31]) that, if under any model of random trees, the height process has a 
continuous limit after a non trivial normalisation, then the contour process has the same limit with the same 
space normalisation (and time normalisation multiplied by 2 to take into account the relative durations of 
these processes). This property has been noticed before in the case of Galton-Watson trees conditioned by 
the size [13, 37]. 

As a consequence (of Lemma 3.19 [31]), to establish 

/ Ht{x{n^-l)) 5t(xn«)\ / 2 



1/2 ' 1/2 / ,^oo' \a "'""P"] 

is sufficient to deduce (3). 

Note now that the condition dp > is necessary in Theorem 3: it ensures that po = limK-s>oo "^o (z^) > 
and that large trees are not close to a linear tree, where most of the nodes have degree one. 

A tree t e T can also be seen as a metric space when equipped with the graph distance dt. A consequence 
of Theorem 3 is that, under Ps(k)' the metric space 

t, -T=^t 
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converges to the continuum random tree encoded by 2e in the sense of Gromov-Hausdorff distance between 
equivalence classes of compact metric spaces. The fact that the convergence of the contour process (or the 
height process) implies the convergence of the trees for the Gromov-Hausdorff topology is well known, see 
for example Lemma 2.3 in Le Gall [35]. So, in particular, to prove Theorem 1 it suffices to prove Theorem 3 
and for this, it is sufficient to prove (4). 

Remark. One can define other models of random trees with a prescribed degree sequence: for example, 
rooted labelled trees. Let <I2s(fc) be the uniform distribution on those with degree sequence s(A;). Since 
labelled trees have a canonical ordering (using an order on the labels to order the children of each node), 
forgetting the labels, they can be seen as plane trees with the same degree sequence, inducing a distribution 
^s(fc) plane trees. By a simple counting argument, it turns out that IF's(fc) = ^s{k)- This situation 

is drastically different from the general case, since the projection of uniform labelled trees on plane tree 
(that is without fixing the degree sequence) does not induce the uniform distribution on plane trees. As a 
consequence, Theorem 1 is also valid for the model of labelled trees with a prescribed degree sequence. 



3 Combinatorial considerations: a backbone decomposition 

In this section we develop a decomposition of trees under Ps(fc) along a branch. It is essentially the 
usual backbone decomposition for Galton-Watson trees due to Lyons, Pemantle, and Peres [see, e.g., 36] 
transposed under Ps(fc)- The decomposition amounts to describing the structure of the branch from the root 
to a distinguished node u, together with the (ordered) forest formed by the trees rooted at the neighbours of 
that branch. 

Forest with a given degree sequence. A forest f = (ii, . . . , t^) is a finite sequence of trees; its 
degree sequence s(f) = X]i=i ^{ti) is the (component-wise) sum of the degree sequences of the trees which 
compose it. If s = (rij, z > 0) is the degree sequence of a forest f, then the number of roots of f is given by 
r = |s| — X]i>o ^'^^^ L^*^ b^ forests of (r ordered) plane trees having degree sequence s. 

We have (see, e.g., [42], p. 128) 

|s| V(rii,i > 0)7 |s| ni>o™i'' 

The content of a branch. Let t be a plane tree, and let u = ii . . . be one of its nodes, where ij G N 
for any j. For j < write Uj = ii . . .ij, the ancestor of u having depth j (with the convention uq = 0, 
the root of t). The set [0, uJ = {uj : j < \u\} is called the branch of u (notice that u is excluded). For any 
i > 0, the number of ancestors of u having i children is written 

Mi{u, t) = #{v : V strict ancestors of u, ct{v) = i}. 

We refer to M(u, t) = {Mi{u, t),i > 0) as the composition of the branch. Note that we necessarily have 
Mo(n, t) = 0. Clearly if u e t, then 

\u\ =^Mi{u,t) = \M{u,t)\. (6) 

i>l 
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Further let LR(u, t) (for left or right) be the set of nodes that are children of some node in [0, u} without 
being themselves in [[0,n]]; note that because of our convention for [[0,'u]], u belongs to LR(u,t) (see 
Figure 4). Let also R{u, t) be the subset of LR{u, t), of nodes lying to the right of the path [0, nj (therefore 




Figure 4: A tree t with a marked node u; the sets in the two right-hand side pictures show the sets R(n, t) 
and LR(M,t). 

u ^ R(u, t)). A node v is in R(u, t) if it is a child of some Ui, for i G {0, . . . , |u| — 1}, and satisfies v > Wj+i 
in the lexicographic order on U. Therefore 

H-i 

|LR(n,t)| = ictiuj)-l) + l = Y,^Uu,t)ii-l) + l 

j=0 i>0 

H-1 

\R{u,t)\ = {ctiuj) -ij+i). 

j=0 

Let ui = < U2 < ■ ■ ■ < u\t\ be the nodes of t, in increasing lexicographic order. Then 

Ht{k) = \uk+i\ and St{k) = \R{uk,t)\ + ct{uk) - 1, (7) 

so that the discrepancy between Ht and St can be accessed using the number of nodes to the right of the 
paths to Ui,i = 1, . . . , This observation lies at the heart of our approach. 

The set of plane trees with degree sequence s and a distinguished node (marked plane trees) is denoted 
by Tg = {(t, ti) : t G Tg, u G t}, and the uniform distribution on this set is denoted P*. Under P*, a marked 
tree {t, u) is distributed as {t\ u') where t' is a tree sampled under Pg and u' is a uniformly random node 
in t'. We now decompose a marked tree [t, u) along the branch [0, nj. First, consider the structure of this 
branch, that we call the contents: 

Cont(t,u) := ((Q(no),«i), . . . , (ct(u|„|_i),i|„|)). 

We write for the set of potential vectors Cont(t, u) when the composition of the branch [0, v\ is 
M(ii, t) = m. Besides, notice that 
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Since, if Cont(u, t) G J™ then | LR{u, t)\ = 1 + X]j>o(^ — l)"^?, we will use the following notation: 



|LR(m)|:=l + 5^(z-i; 



mi. 



i>0 



The forest off a distinguished path. For a tree t and any node v G t, let tv = {w : vw G t} be the 
subtree of t rooted at v. The sequence of trees F(t, u) = {ty,v G LR(n, t)) is the forest constituted by the 
subtrees of t rooted at the vertices belonging to LR{u, t), and sorted according to the rank of their root for 
the lexicographic order. 

The decomposition which associates (Cont(t, u), F(t, u)) to a marked tree {t,u) is clearly one-to- 
one. The following proposition characterises the distributions of M(u, t), Cont(u,t), and |R(u, t)| when 
(t,u) is sampled under P*. In the following, for two sequences of integers s = (no,ni,...) and 
m = (mo, mi, . . . ) we write s — m = (no — mo, ni — mi, . . . ). 

Proposition 4. Let s = (no, ni, . . .) be a degree sequence and let m = (mo, mi, . . .) be such that mo = 0, 
and mi < riifor any i > 1. Let (t, u) be chosen according to P*. 
(a) We have 

|LR(m)| |m|! |s — m|! 



^•(M(u,t) =m) 



s ! s — m 



n 

i>l 



(b) Moreover, for any vector C G J^, 

P* (Cont(u,t) = C I M(u,t) = m) = 1/#J'^ 

(c) For any x > 0, and m such that Pg(M(u, t) = m) > 0, 



Rfu,t) 



u 



> X 



M(u,t) = m 



j>i fc=i 



> X 



(9) 



where the C/j*^'' are independent random variables, U^^'^ is uniform in {0, . . . , j — 1} and where is the 
variance associated with (pi = nj/|s|, i > 0) (as done on (2)). 

Proof. Since the backbone decomposition is a bijection, we have for any vector C G J™, we have 



(Cont(u,t) = C) 



|s| • #Fs 
|LR(m)| 



s — m 



|s — m| \{ni — rrii, i > 0) J / v(nj, i > 0)^ 
by the expression for the number of forests in (5). As P* (Cont(u, t) = C) is independent of C G J™, it 
suffices to multiply by #J™ in order to get P* (M(u,t) = m). After simplification, this yields the first 
statement in (a), and then (b). Now, (b) implies that for any R > 0, and any composition m for which 
P» (M(u, t) = m) > 0, we have 



^:(|R(u,t)|=i?|M(u,t) = m) 



, j>i k=i 



(fc) 



R 



where the C/j'^^ are independent random variables, and Uj'^' is uniform in {0, ... ,j — 1}. This implies 
assertion (c) and completes the proof. □ 



r(fc) 
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4 Convergence of uniform trees to the CRT: Proof of Theorem 3 



4.1 The general approach 



Our approach uses the phenomenon observed in Marckert & Mokkadem [37] in the case of critical 
Galton-Watson tree (having a variance): under some mild assumptions the Lukasiewicz path St and the 
height process Ht are asymptotically proportional, that is, up to a scalar normalisation, the difference be- 
tween these processes converge to the zero function. It turns out that a similar phenomenon occurs when the 
degree sequence is prescribed, and this is the basis of our proof. 

In order to prove Theorem 3 we proceed in two steps: the first one consists in showing that the depth- 
first walk St associated to a tree sampled under Ps(k) converges to a Brownian excursion. The process St is 
much easier to deal with than Ht, since St is essentially a random walk conditioned to stay non-negative, 
and forced to end up at the origin (precisely at —1). We provide the details in Section 4.2 below. The core 
of the work lies in the second step, which consists in proving that rescaled versions of St and Ht are 
indeed close, uniformly on [0, 1]. More precisely, by Theorem 3.1 p. 27 of [19], the following proposition 



is sufficient to show that ^''^2S't(nK-) and ^'^cT^^?t((iK — !)•) have the same limit in (C[0, 1], 

1/2 

Proposition 5. Under the hypothesis of Theorem 1, there exists = o(nK ) such that, as k ^ oo, 



1/2 9; 



s(k) sup 
\a;e[0,l] 



5t(xn«)-^ift(x(n,-l)) 



0. 



In order to prove Proposition 5, recall the representations of St and Ht in terms of |R(u, t)| and \u\ 
given in (7). A non-uniform version of the claim in Proposition 5 is the following: 

Proposition 6. Assume the hypothesis of Theorem 1. Let (t, u) chosen under IPs(^)- There exists = 

1/2 

o(nre ) such that, 



■ s{k) 



R(u,t) 



0. 



Again, by (7), one sees that 

|Rfu,t) 



St(u) 



'-Htiu-l) 



< A,, 



and Ak = o{^/n^), by assumption. Therefore, Proposition 6 implies then that the proportion of indexes 
m G [O, n^]] for which 

2 

St(m + 1) - ^Ft(m) >c«-Afc 

goes to (we will choose such that = o(ck)). In this case, if the sequence of processes (D^ := 
nK^^'^{St{xnK) — ^Ht{x{nK — l))), k> 1) is tight, we can deduce the convergence of the finite distributions 
of (Dk, k > 1) to those of the null process on [0, 1]. Hence, to show Proposition 5, it suffices to show 
Proposition 6 together with the tightness of (D^, k, > 1) ; the tightness is actually also needed to show the 
convergence of in distribution in C[0, 1] (see [see, e.g. 19]). Since under the sequence of distributions 
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IP's(re)' the family of rescaled versions of St (see Section 4.2) is tight, it suffices to prove that the family of 
rescaled versions of Ht is tight as well. 

We need also to say a word about the fact that both processes St and Ht have a small difference in 
their time rescaling. Again, this is not a problem since the process St has its increments bounded by = 

Remark. Under slightly stronger assumptions on the degree sequences, it is possible to control the dis- 
crepancy between the height process and the Lukasiewicz path at every point in {0, 1, . . . , — 1}. More 
precisely it would be possible to show that 

' >cj =o(l/n,). (10) 



s(k) 



R(u,t)|-^|u| 



Using the union bound, this yields the convergence of the rescaled height process to a Brownian excursion, 
as a random function in C[0, 1]. One is easily convinced that with the optimal assumptions for Theorem 3, 
the bound in (10) might just not hold. 

We now move on to the ingredients of the proof: we first give the details of the convergence of 
H/t St{ ■ n^) to a Brownian excursion in Section 4.2, then we prove tightness for Ht (• (n^ — 1)) 
in Section 4.3. The longer proof of Proposition 6 is delayed until Section 5. 

4.2 Convergence of the Lukasiewicz walk 

In this section, we give the details of the proof of the convergence of the depth-first walk under Ps{k) 
towards the Brownian excursion. 

Lemma 7. Assume the hypothesis of Theorem 1. Under Ps(k)' 

/ 5t(xnK) \ (law) 



_ nl/2 / K— !-+oo 



> e 



as random functions in C[0, 1]. 



Proof. Let c = {ci, C2, . . . , Cn^} be a multiset of integers whose distribution is given by s(k). Let 
TT = (vTi, 7r2, . . . , VTn^) bc a Uniform random permutation of {1, 2, ... , n^}, and for j G {1, . . . , n^}, define 

3 

1=1 

Theorem 20.7 of Aldous [5] (see also Theorem 24.1 in [19]) ensures that, when = o(y^), 

fW^isn^)] (law) 



1/2 



se[o,i] 



in C[0, 1], where b = (b(s), s G [0, 1]) is a standard Brownian bridge. 
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The increments of the walk {WTr{j),0 < j < n^) satisfy Ct^- — 1 > —1 for every i (such walks are 
sometimes called left-continuous), and furthermore, Wnin^) = —1. The cycle lemma [23] ensures that 
there is a unique way to turn the process W-^ into an excursion by shifting the increments cyclically (in each 
rotation class there is a unique excursion) : to see this, first extend the definition of the permutation, setting 
TTj := TTj-n^ for any j G {n^ + 1, . . . , 2ni^}. For jj^ the location of the first minimum of the walk W-^ in 
{l,...,n«;},we have that W-,r{j + jn) — W^ttOtt) is an excursion in the following sense: 

SAJ) ■■= W^U + J-) - W^Un) > for j < n« and ^^(n,) = -1. 

Since in each rotation class there is exactly one excursion, and since the set of excursions hence obtained is 
exactly the set of depth-first walk of the trees in Ts(^), it is then easy to conclude that for t uniformly chosen 

{St{j),0< j < n,) ^ (S^(j),0 < j < n,), 

for vr a random permutation of {1, . . . jH^}. Since the Brownian bridge b has almost surely a unique 
minimum, the claim follows by the mapping theorem [19]. □ 

4.3 Tightness for the height process 

— 1/2 

The rescaled height process under Fs{k) is the process in C[0, 1], /i^ = H{ ■ (n^ — 1)). In 

this section, we prove that the family {h^, k > 0) is tight (we will omit the k when unnecessary). Since 
^k(O) = 0, the following lemma is sufficient to prove tightness [see, e.g., 19]. 

Let ujh be the modulus of continuity of the rescaled height process h: for 5 > 

ujhi5) = sup \h{s) - h{t)\. 

\t-s\<S 

Lemma 8. Under the hypothesis of Theorem 1, for any e > and t] > 0, there exists 6 > such that, for 
all K large enough, 

Ps(k)K(5) > e) < ??. 

The bound we provide consists in reducing the bounds on the variations of h to bounds on the variations 
of the Lukasiewicz path S, which is known to be tight since it converges in distribution (Lemma 7). The 
underlying ideas are due to Addario-Berry et al. [4] and Addario-Berry [1] to prove Gaussian tail bounds 
for the height and width of Galton-Watson trees and random trees with a prescribed degree sequence, 
respectively. 

For a plane tree t G T, let be the mirror image of t, or in other words, the tree obtained by flipping 
the order of the children of every node. Then, we let := S^- be the reverse depth-first walk. Observe 
that the mirror flip is a bijection, so that St and have the same distribution under Ps(k)- 

Proof of Lemma 8. In this proof, we identify the nodes of a tree t and their index in the lexicographic order; 
so in particular, we write Ht{u) for the height of a node u in t, and we write |u — f | < 5n^ to mean that 
u and V are within Jn^ in the lexicographic order (that is, u = Ui and v = Uj for some i and j satisfying 
\i- j\ < <Jn«;). 
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Consider a tree t and two nodes u and v. Write « A f for the (deepest) first common ancestor of u and v 
in t. In tlie following we write u ^ v to mean that u is an ancestor ofvmt{u = vis allowed). Then, 

\Ht{u) - Htiv)\ < \Ht{u)- Ht{uAv)\ + \Htiv)- Ht{uAv)\, (11) 

so that it suffices to bound variations of Ht between two nodes on the same path to the root: 

sup \Ht{u) - Ht{v)\ < 2+2 sup \Ht{u) - Ht{w)\. 

(The extra two in the previous bound is needed because of the following reason: the closest common 
ancestor uAv might not be within distance Sn^of either u and v; however, there is certainly a node w lying 
within distance one ofuAv that is visited between u and v.) Now, observe that, for w ^ u, every node v on 
the path between w and u which has degree more than one contributes at least one to the number of nodes 
off the path between w and u: 

1+ E ict{v)-l)>Ht{u)-Ht{w)- l{c.(.)=i} 

However, one may also bound this same number of nodes in terms of the depth-first walk St, and the reverse 
depth- first walk : 

1+ J2 {ct{v)-l)<St{u)-St{w) + S^{u)-Stiw) + 2ctiw). (12) 
In other words, we have 

sup \Htiv) - Ht{u)\ < 2+2 sup |5t(n) - Stiw)\ + 2 sup |5,-(n) - Sriw)\ 

\u—v\<5nK \u—w\<Sr\fi,w^u \u—w\<Snfi,w:<u 

+ 2maxci(w)+ sup V] l!ct(v)=i} 
< 2+2 sup \St{u) -St{w)\+2 sup \S^{u) - {w)\ 

\u—w\<5n \u—w\<5nK 

+ 2Ak+ sup Y l{ct{i,)=i} 
<2+2ny2,^,(5) + 2nyV-(5) + 2A«+ sup ^ l{c,(.)=i}, (13) 

— \/2 —1/2 

where ujs and a;^- denote the moduli of continuity of the rescaled Lukasiewicz path St and S*^ , 
respectively. 

The first four terms in (13) are easy to bound since = o{^/n^) and, after renormalisation, St and 
S^ are tight under Ps(k)- The only term remaining to control is the one concerning the number of nodes of 
degree one: 

Yt{5) := sup Y ^{ct{v)=i}- 

\u-w\<Sn^ ^^^^^ 

To bound Yt{6) we relate the distribution of trees under Ps(k) to those under Ps(/t)*> where s(k)* = 
(nQ,n*, . . . ) is obtained from s(k) by removing all nodes of degree one, i.e., = and n* = Ui for 

12 



every i ^ \. Then, in a tree t* sampled under IPs(k)*' oii^ has Yt*{S) = 0. Recall also that = o{y/n^). 
Now, for a sum of three terms to be at least e, at least one term must exceed e/3. So for every e, 5 > 0, there 
exists a 5 > such that, for all k large enough, 

PsW*K(<^) > e) < PsM*(2a^s(5) > e/3) + P,(,)*(2a;,_(5) > e/3) 
= 2P,(,)*(6^^,(5) >e) <r/, 



since, under IPs(k)*, St and have the same distribution and St is tight, and since Ps(k)* (^k^k 

— 1/2 

e/3) is zero for k large enough. This proves that Ht is tight under Ps(k)*- 

Now, we can couple the trees sampled under Ps(k)* ^rid Psl/^)- Since the nodes of degree one do not 
modify the tree structure, a tree t under Ps(k) niay be obtained by first sampling t* using Ps(k)*' ^^id then 
placing the nodes of degree one uniformly at random : precisely, this insertion of nodes is done inside the 
edges of t* (plus a phantom edge below the root). Given any ordering of the edges of t* (plus the one below 
the root), the vector {X^, . . . , X* of numbers of nodes of degree one falling in these edges is such 

that 

{XI, Xl_^^^^,) £ Multinomial (n,(K); ^^^^^^^y • • • , ' 

Conversely, t* is obtained from t by removing the nodes of degree one, so that t and t* can be thought 
as random variables in the same probability space under Ps(k). To bound Yt{6), observe that it is unlikely 
that adding the nodes of degree one in this way creates too long paths. 

In fact, "the length of paths" is expected to be multiplied by 1 + for Qk = ni{K)/{n^ — ni{K)). Let 
a = 2+gK>andfix5 > such that Pg(^)* (cjft(5) > e/a) < r7/2;sucha5 > exists since the height process 
is tight under Ps{k)*- Note that since we add nodes in the construction of t under Ps(k) from t* under Ps(k)*> 
nodes that are within Jn^ in t are also within dn^ in t*. Write h* for the rescaled height process obtained 
from t*, the tree associated with t by deletion of all nodes of degree one (the rescaling stays \/n^). We have, 

Ps(.)('^h('J) > e) < Ps(«)(wh*(5) > e/a)+F,(^^)iuhid) > e , Uh*id) < e/a) 
< FsiK)*M6) > e/a)+F,^,){uh{6) > e | uJh*{S) < e/a) 

j2 a+x:)>e^, 

X,>e^{l-l/a) 

where the Xi are i.i.d. Binomial(ni, l/(nK — rii)) random variables. The last line follows from the standard 
fact that the numbers (X^) obtained from a sampling without replacement (of the ni{ti) nodes of degree 
one) are more concentrated than their counterpart {Xi) coming from a sampling with replacement [5]. 
Now, the sum in the right-hand side is itself a binomial random variable: 



> 



Xi = Binomial { e^/n/^ni/ a, ^ ) 

1=1 ^ 
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whose mean is e^Jukq^i/ (2 + q^) when e-^/n^(l — 1/a) = £y/n^{l + q^) /(2 + g^)- By Chemoff's bound, 
using that converges, it follows that for some constant c > valid for n large enough, 

Ps(K)K('5) > e) < r//2 + ^n^e-^^A. 

Finally, for all k large enough, with this value for 5, we have lPs(K)('^/i('5) > e) < which completes the 
proof. □ 



5 Finite dimensional distributions: Proof of Proposition 6 



5.1 A roadmap to Proposition 6: identifying the bad events 

Our approach consists in showing that if the event in Proposition 6 occurs, then one of the following 
three events must occur: (1) either the depth |u| of node u is unusually large, (2) or the content of the branch 
\0, uj is atypical, (3) or the number of nodes to the right of the path is not what it should be, despite of the 
length |u| and content M(u, t) being typical. 

We will then prove that those simpler events are unlikely. For h > 0, and two sequences a = (a^, k > 
0), and 6 = (6k, K > 0) we define families of sets Ah^a,b as follows. Given a sequence of degree distribution 
(s(k),k>0). 



A 



h,a,b[H) 



m : m 



h, 




2 



i>l 



If m G Ah^a,bi>^) then |m| = h, and m corresponds to the content of a branch [0, uJ such that \u\ = h. 
The set Ah^afii^) are designed to contain most typical contents of a branch of length h under Ps(k)' provided 
the choices for the sequences a and b are suitable. The decomposition of the bad event we have outlined 
above is then expressed formally by 



|R(u,t)|-^|u| 



>Ck) <Fl^,){\u\>x^) 

+ P:(,)(|LR(u,t)| >x>^) 

+ P*(,) |U| V |LR(u,t)| < X^,M{u,t) i y Ah,a)^{K) 



h<x^/n 



h<.Xy/r\n 



Rfu,t) 



u 



> CK,M(u,t) = m 



(14) 



Proving Proposition 6 reduces to proving that every term in the right-hand side above can be made arbitrarily 
small for large k by a judicious choice of a^, b^, and x. The bound on the first term is a direct consequence 
of the Gaussian tail bounds for the height of trees recently proved by Addario-Berry [1] in the very setting 
we use: 

P*(k)(|u| > x^) < P,(k) (max|u| > x^j < exp{-cx^ /a^), (15) 
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for a universal constant c > and all sufficiently large k. The second term is bounded using the depth-first 
walk S and the reverse depth-first walk , as in the proof of Lemma 8: 

P*(^)(|LR(u,t)| > x^H^) < ( max {S{k) + S~{k)} + A, > x^) 
for all K large enough, since = 0(11^) and S and S~ have the same distribution under Ps(k). We finish 



— 1/2 

using the tightness of Dk ^(nfi;.) under Ps(k); more precisely, we have 



:(,)(|LR(u,t)| >xVH;)<16-9 



(7t 



(16) 



by Lemma 20.5 of [5]. The bounds on the two remaining terms are stated in Lemmas 9 and 10, the proof of 
which appear in Sections 5.2 and 5.3, respectively. 

Lemma 9. Since = o(\/nK) there exists such that A^ < Ek-^/tu, with < — 0. Let = 

1^. Then, for every x > 0, and all k large enough, 



^•(^) I |u| V |LR(u,t)| < Xy/^,M{u,t) U Ah,a,b{^) I < 6x^6^ exp 



e, 



-1/2 



2x(a2 + l) + 2 i ■ 



Lemma 10. Since Afc = o{^/n^) there exists such that A^ < Si^^/n^, with and e^^^^ 

o{nf^) as K ^ 00. Let = e^^y^, = e^/'^n^, and = el/^^,/n^. Then, for all k large enough. 



IRfu.tl 



u 



> c«;,M(u,t) = m < 2e 



-1/2 



(17) 



Before proceeding with the proofs of these two lemmas, we indicate how to use them in order to com- 
plete the proof of Proposition 6. Let be such that A^ < e^y^, with — as k — )• 00. Then, set 



Ik = eK^^A/^K' ^« ~ Ek'^^Ik and = el/^y/n^. Let now e > be arbitrary. Pick x > large enough such 
that, for all k large enough, 

PsV)(|u| > x>;) +P:(,)(|LR(u,t)| > x^) < e/2. 

The bounds in (15) and (16), and the fact that a'^ — )• ensure that this is possible. The value for x being 
fixed. Lemmas 9 and 10 now make it possible to choose kq large enough such that, for all k > kq, the two 
remaining terms in the right-hand side of (14) also sum to at most e/2. Thus, for all k > kq, we have 



■ s(k) 



|R(u,t) 



u 



> Ck < e, 



which completes the proof, since e was arbitrary. 
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5.2 The content of a branch is very Ukely typical: Proof of Lemma 9 

We now prove that, on the event that |u| and |LR(u, t)| are not too large, the content of the branch [[0, uj 
is typical with high probability. 

We start by rewriting the probability of interest using Proposition 4: 

p:(^) |u| V|LR(u,t)| <xVH;:,M(u,t) J Ah,aM 
= Yl Ki.)i\^\=h,\\-R{n,t)\<x^,M{u,t) ^Ah,aM^)) 

h<x,Jn^ 



E E P:(,)(|u|=/i,M(u,t)=m) 



h<xjn^ |m|=h 

"i^^h a 6(«).lLR(m)|<x^ 

M^iMn (:•).... 

h<Xyfn^ \m\=h l>l ^ ' 

'"^^h,a,6('^)'l'-R(m)|<a;^ri77 

where, for short, we have written rij instead of UiiK). We now reduce the right-hand side to an expected 
value with respect to multinomial random variables. Let (Pi, i > 1) be multinomial with parameters h and 
{ini/{nK — 1), i > 1). Then, for any m = (0, mi, m2, . . . ) such that |m| = h, we have 

Now, since (1 — x)^^ < exp(2x) for |x| < 1/2, we have for all h < x^/n^, and all k large enough. 



1=0 ?=0 



Note also that, for every i > 1, we have n,! < ' (nj — mj)!, so that, rewriting (18) in terms of events with 
respect to {Pi, i > 1), we obtain 

P*(^) |u|V|LR(u,t)|<x^,M(u,t)^ y Ah,a,b{^) 



h<Xx/n 



|LR(m)| {n,-h)\{^^-lf 



E E '^• ^'""'^T^ ^ n ' ,, -IP((P.^>l) = (^.^>l)) 



^ E -7^/ E ip((^^>^>i) = K,^>i)) 

h<Xy/n^ ^ |m| = h 

< 2X^6^' sup P((P,i > 1) A,a,6(K)). 



Now, we decompose the set of m in the right-hand side so as to obtain bad events that are individually 
simpler to deal with 

P:(,)(|u|V|LR(u,t)|<xVH;,M(u,t)0A,a,6('t))<22;2e^' sup (Ci + C2) 

h<x^/n 
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where 



Ci 



i>l 



i - 1 



2 



>aA and C2 = ^ [Y^i^Pi > 



i>l 



We now bound the terms (,1 and ^2 individually. 
The first term Ci- Observe first, that 



E 



i>l 



i - 1 



hal 
2 ' 



so that bounding Ci consists in bounding the deviations of (a function of) a multinomial vector. However, 
one can write 

i>i j=i 

where Bj, j = 1, . . . , h, are i.i.d. random variables taking value {i — l)/2 with probability ini/{n^ — 1), for 
z > 1. Now, the sums X]^=i(^i ~ ^[-^j])' ^ = 0, 1, . . . , /i, form a martingale. We bound their deviations 
using a concentration inequality from [38] (Theorem 3.15), which says that if 5 is a sum of independent 
random variable Xi + • • • + X„ such that E(S') = fi, var(S') = V, and if for all k — E(Xfc) < b, then 
P(5' - fi>t) < e-*''/(2^(^+^*/(3^)). The variance of Bj may be bounded as follows: 



var(5,) < E[S|] = 5: < A. 5: 

j>i j>i 



2 — 1 iUi 



for all K large enough. Now, since max{|Sj — E(i?j)| : j = 0, . . . , /i} < A^, one has, for h < x^/n^, 

h 



YiBj-EB, 



for all K large enough, since = ^^^y^ = o(y^). It follows that, for every h < Xy^, we have 



> O/t I < 2 exp 
< 2 exp 



2/iA«cT2/4 + 2A«a J3 

„2 



Xi/h^^AKcr^ 



Ci = sup 

h<x^/r\K 



i>l 



i - 1 



-1/2^ 



> Ok < 2 exp 



(19) 



The second term ^2- We bound C,2 using the idea we used when bounding C,i: one can express the 
event in terms of independent random variables Bj, j = 1, . . . , /i, where Bj takes value with probability 
zni/(nK — 1). Observe first that 



E 







h 




= E 




i>l 







^E^ 



i>l 



IIk, - 1 



< h^^{al + 1) 
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So, we have 



i>l 



h 



<f\^{B,-E[B,])> 



2 



1/2 

for all K large enough, since /lA^ < xe^nK = o{eK n^) = o(6k). The right-hand side above can be 
bounded using the martingale inequality in [38] (Theorem 3.15). We note that the variance of Bj satisfies 



var{Bj)<E[B]] = Y,'' 



<A^(ct2 + 1). 



Since max{|i?j| : i = 1, . . . , /i} < A^, it follows by McDiarmid's inequality that 



C2 < P [y1{B, - E[i?,]) > y j < exp (- 



< exp 



exp 



2x^A3(a2 + l) + 2A26,/3 

hj, 

'2(x(a2 + l) + l/3)A2 

-3/2 \ 



(20) 



2x{al + l) + 2/2,j' 

for all K large enough, since A^y^ = 0(6^). 

To complete the proof, it suffices to combine the bounds in (19)-(20), and observe that they imply the 
claim for k large enough, since the upper bound in (20) is much smaller than the one in (19). 



5.3 The structure of a branch with typical content: Proof of Lemma 10 

Finally, we consider the probability that the structure of a branch is not what one expects, in spite of the 
length and content being close to the typical values. The left hand side in (17) is bounded by 



sup F^(^) 



|R(u,t)| - y|U| 



by Proposition 4 (3), where C/j*^^ are independent random variables with U'^'^' uniform on {0, 1, ... ,j — 1}. 
By the triangle inequality, the quantity in the right-hand side above is at most 



M(u,t) = m 



sup r 



(k) 



j>l k=l 



sup 



E 



nii- 



i-i 



j>i k=i 



azh 



i-1 



(21) 



By definition of A/j.^ ;,(«), and since > 20^ for all k large enough, the quantity in (21) is bounded by 



sup 

h<Xy/n^ 



E 



i-1 



j>i k=i 



(fc) 



> 
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(k) 

Now, since all the random variables Uj , j > 1, k = 1, . . . , are symmetric about their respective 
mean (j — l)/2, one obtains using Chernoff's bounding method 



j>l j>l k=l 



(k) 



t>o ii Vjsinh(V2); 



i>i 



24 24 2880 y 



< 2 inf exp ( — t— + y^m,- 



t2 



24 48 



(22) 



< 2 inf exp ( —t— H y^m,-? 

" te{o,i) I 2 24 



Here the third line follows from the bounds log(sinh(s)) < log(s) + and log(sinh(s)) > log(s) + 
- s^/180 vahd for s > 0. Finally, we obtain 



sup 



E-.^-EE^f 



2 J 

j>l A:=l 



> — < 2 inf exp -t— + 
' te(o,i) 



24 



upon choosing t = Qci^/hK, which is indeed in (0, 1) for n large enough (we restricted the range of t in (22)). 
This completes the proof since 3c^/ (26^) = ^^^/2 > for all k large enough. 



6 The limit of rescaled Galton- Watson trees: Proof of Proposition 2 

Consider the family tree of a Galton-Watson tree t with offspring distribution /i = (/Xj, i > 0) starting 
with one individual. Let be the probability distribution of t. Denote by St := (nj(t), i > 0) the empirical 
degree sequence of t, let 

fii = ni(t)/|t|, 

2 _ „-2 '^j(t) 



j>0 



Itl - 1 



A = max{i : nj > 0}. 

Note that is not the variance of the empirical distribution (/ii, i > 0) but has been chosen to be consistent 

''s(k) 



with the definition of a"^, in (2). Write P"( • ) = • | |t| = n). In what follows, all the assertions 



containing " Pj]" are to be understood "for n such that P^(|t| = n) > 0"; similarly, the limit with respect to 
Pj] are to be understood in the same manner, along subsequences included in {n : P^(|t| = re) > 0}. 
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Lemma 11. Assume that /x has mean 1 and variance o"^ G (0, +oo). Then under PJ], 

[fi,^^A/^)^{^l,a%^), (23) 

n ^ 

where the convergence holds in the space M (N) x M x M equipped with the product topology. 

In this lemma, 7W (N) is the set of probability measures on N. The topology on Ai (N) is metrizable, for 
example, by the distance 

i>0 

where is the distribution of the ith first marginals under v and dxv is the distance in total variation. 
Since here the limit is the deterministic measure ^i, it suffices to show that, for all i, jli — )• jn in probability 
as n — )• oo. With D it is easy to construct a metric on A^(N) x M x M making of this space a Polish space. 
Hence, by the Skohorod theorem there exists a probability space where versions of {fi, a"^, A/y^) under P^ 
converges almost surely to (/x, o"^, 0). So on the conditional space, the hypotheses of Theorem 1 hold almost 
surely, and then its conclusion, which is a limit in distribution, also holds. Of course, we do not mean that 
any sequence of trees for which the degree distribution satisfies the conditions of Theorem 1 converges to the 
continuum random tree; one also needs that for any fixed k, conditional on the degree sequence s the trees 
are distributed according to Pg. This fact certainly holds for conditioned Galton-Watson trees: under P^ all 
trees with the same degree sequence occur with the same probability, and conditional on its degree sequence 
s, a Galton-Watson tree is precisely distributed according to Pg. To summarise, to prove Proposition 2 it 
suffices to prove Lemma 11. 

Proof of Lemma 11. The claim is about properties of the degree sequence of Galton-Watson trees con- 
ditioned on their total progeny. We first provide a way to construct the degree sequence. Consider the 
Lukasiewicz walk 5„ associated with a tree t under P^; the degree sequence of the tree t is essentially (just 
shift by one) the empirical distribution of the increments of S'„. More precisely, consider first a random 
walk W = {Wk, k = 0, . . . ,n), with i.i.d. increments = Wk — Wk-i, k = 1, . . . ,n with distribution 

Ui = F{Xk = i) = fj,i+i i > -1; 

then S = {So, . . . , 5„) is distributed as W conditioned onW € A^iin) where 

Atiin) = {w = (wq, . . . ,Wn) ■■ wq = 0, Wk > 0,1 < k < n,Wn = -1} 

is the set of discrete excursions of length n. 

Write Ki = #{k : Xk = i - 1}, and K = {Ki,i > 0). Then, if G ^tiin), the sequence 
K = {Ki,i > 0) is distributed as the degree sequence of a tree under P^. In other words, we have 

¥{K G 5 I G ^li(n)) = ¥l{{ni{t),i> 0) G B). 

By the rotation principle, we may remove the positivity condition : 

nK(^B\Wn = -1) = K{{n^{t),l > 0) G B). 
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Our aim is now to show that the condition that is a bridge imposed by Wn = — 1 does not completely 
wreck the properties of W in the following sense: let = <7(Wo, . . . , Wk) be the cr-field generated by 
the k first Wi', then there exists a constant c G (0, oo) such that for any n large enough, and for any event 



B £Ti 



[n/2\ 



one has 



¥{B \Wn = -l)< cF{B). (24) 

That is: any event B in -^[n/2j with a very small probability for a standard (unconditioned) random walk 
also has a small probability in the bridge case (conditional on Wn = —1). The argument proving this claim 
is given in Janson and Marckert [29], page 662 and goes as follows: 

nWln/2\=X,Wn = -l) 



¥{B I Wn 



Y,nB\W^n/2\=X,Wn = -l 

X 

Y,nB\W^n/2\=x)nWln/2\ 



FiWn = -1) 
nWn-ln/2\ =- 



1) 



nwn 



-11 



It then suffices to (a) observe that sup^ P( W„_ |^„/2J = ~x — 1) < cj \/n for some constant c\ G (0,oo) 
[41, Theorem 2.2 p. 76], and (b) use a local hmit theorem to show that = — 1) > C2\Jn, for some 

constant C2 E (0, oo) and all n large enough [25, page 233]. This gives the result in (24) with c = ci/c2. 

Now using that the increments (Xi, . . . , X„) under P( • | Wn = —1) are exchangeable, any concentra- 
tion principle for the first half of them easily extends to the second half (the easy details are omitted). Con- 

1/2 

sider the degree sequence induced by the first half of the walk: let K- = ^{k : Xk = i — I, k < [n/2j}, 

1/2 

and note that the K- ' are J^^„/2j"nis^surable. For W (that is, with no conditioning), we have 

1 ^,1/2.. ^.o 1 



Ln/2J 



ln/2\ 



[n/2\ 



<7„ 



(25) 



i>0 ^ j=l 

by the law of large number, since Xi owns a (finite) moment of order 2. Hence, for any e > 0, writing 



Ev{e) 



\n 2\ ^ * ^ 

L ' i>0 



1)^ 



> e 



we have f[Ev{e)) — )• and thus, according to the bound in (24), f{Ev(e)\Wn = 
Using the argument twice (one for each half of the walk) yields convergence ct^ 

n — 7- oo. 

The same argument also proves that 

^1/2 



1) — )• 0, as n — )• oo. 
(T^ in probability as 



Ln/2J 



> e 



Wn 



-1 



0, 



which yields /Tj — in in probability. 

The fact that A = o{^\/n) (in probability) under PJ] is also a consequence of the convergence of the sum 
given in (25). To see this, let C(a) = {k : P(X2 > ) > a/A;}. Since EfX^] = Y.k>Q^^^\ > k) < +oo, 
then kF{Xl > k) ^ , entailing #C{a) < +oo for any a > 0. In particular, for any e > 0, 



#{n : nF{Xf > en) > a/e} < +oo. 
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Taking a = se', one obtains that ^{n : nF{Xf > en) > e'} < +00, which implies that 

P(max{Xi :i<n/2}> e^) < nP(Xf > en) > 0. 

71— >00 

So under the unconditioned law one has A = o(\/n); we complete the proof using the bound in (24). □ 

7 Application to constrained coalescing processes 

In this final section, we discuss an application of Theorem 1 to a coalescence process with particles 
having constrained valences. 

The famous additive coalescent [10, 15, 16, 42, 43] can be seen as arising from the following natural 
microscopic description. Consider a set of n distinct particles {1, 2, . . . , n}. The particles are initially free, 
and form n clusters; the clusters are organised as rooted trees. The clusters merge according to the following 
dynamics. At each step, choose a particle u uniformly at random; it belongs to some cluster T rooted at r. 
Choose uniformly a second cluster T' ^ T, with root r' . Add an edge between r' and u to obtain a new 
cluster rooted at r. At each step, the system consists of a forest of general rooted labelled trees (an acyclic 
graph on {1, 2, . . . , n} with a distinguished node per connected component). The process stops after n — 1 
steps, when the system consists of a single rooted labelled tree. The final tree is then uniform among all 
rooted labelled trees. 

One can similarly define a system of coalescing particles where the degrees would be constrained. 
Different algorithms might be used, depending on the precise way the uniform choices are made, that yield 
a priori different trees. 

Labelled particles. Consider the set of particles {1,2,..., n}, and a set of degrees ci < C2 < • • • < Cn. 
Write s = (nj,i > 0) for the associated degree sequence, nj = : Cj = i}. Assign randomly the 
particles a degree. For instance, this can be done using a random permutation a = (o"(l), . . . ,cr{n)) of 
{1, 2, . . . , n} and assigning degree Co-(j) to particle i. Think now of the particle i as initially having edges to 
Co.(i) free slots that can each contain a single particle. The particles will now merge to form clusters. Each 
cluster is represented by a tree with a distinguished vertex (the root). Initially, each particle sits in a tree 
containing a single node (which is then also the root). Proceed with the following algorithm to merge the 
particles, as long as there are free slots left: 

• Pick a free slot s uniformly at random; say it is bound to particle p lying in the cluster rooted at r. 

• Pick another cluster, uniformly at random, rooted at some node r' . 

• Merge the two clusters by assigning r' to the free slot s; this creates an edge between the particles p 
and r', and removes the slot s from the set of free slots. The new cluster is rooted at r. 

At every iteration, precisely one slot is filled and the process stops after n — 1 steps. The process yields a 
random tree labelled tree T^. 

The labelled tree is uniform in the set of labelled trees having the same specified degree sequence. 
To see this, just consider the encoding of the process by the final labelled tree, together with a labelling of 
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the edge indicating their order of appearance. At iteration i G {1, . . . , n — 1}, there are n — i free slots 
left and n — i + \ connected components, so that the probability that any couple free slot/other connected 
component is precisely 

1 

(n — z)2 

Overall, the probability to obtain any particular pairing free slots/particles together with a history is 

" 1 1 

n (n-i)2 " (n- 

The same particle adjacency — hence the same labelled tree — is obtained by the 0^=1 '^j- ways to pair 
the free slots with particles; and for any labelled tree there are exactly (n — 1)! distinct histories. Finally, 
among the n\ ways to assign the labels to particles in the first place, ni>o correspond to the degree/label 
pattern of the tree, it follows that the probability of seeing any labelled tree after n — 1 iterations is precisely 

which depends only on the degree sequence, so that trees with the same degree sequence are chosen uni- 
formly. (This is also, as it should, the inverse of the number of labelled trees with degree sequence given by 
s = (nj, i > 0) [43, Example 6.2.2].) 

Unlabelled particles. Consider a degree sequence in the form of s = (nj, i > 0) where denotes the 
number of nodes of degree i. For ci < C2 <•••< Cn of size n. So X]j>o Ci = n — 1. As before, we think of 
the particles as having empty slots, but since there are no labels, we impose that the slots of any given particle 
be ordered. The particles then merge according to the same algorithm, in order to distinguish particles use 
the canonical labelling giving label i to the particle with degree q. After forgetting the canonical labelling, 
the process yields a plane tree T„. 

Again, the plane tree T„ is uniform among all plane trees with the correct degree sequence. The argu- 
ments are similar, only simpler, to those we used in the labelled case. Since, for a given plane tree, there are 
Wi>Q nil ways to assign the canonical labels to the nodes, the probability to obtain any given plane tree is 

TT'^i' X 7 N.o X (n — 1)! = n( , ,| 

ii (n-l)!2 ^ > \{ni,z>0)J 

In these coalescing particle systems, one of the parameters of interest is the metric structure of the cluster 
(structure of the "molecule") eventually obtained after all particles have coalesced into a single component. 
In the unrestricted case, the metric structure is described by the CRT of Aldous. Our result shows that 
the quenched version, conditional on the degree sequence, is also valid under reasonable conditions on the 
degree sequence imposed. Results for Galton-Watson trees conditioned on the size only are recovered by 
sampling the degree sequence. 

For instance, to recover the unrestricted version of the merging process, one can sample n independent 
Poisson(l) random variables, and keep them if their sum equals n — 1; the n exchangeable values obtained 
are then the degrees Ci, C2, . . . , C„ of the n particles. 
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